Column-Type Prediction for Web Tables Powered by Knowledge Base and Text

نویسندگان

چکیده

Web tables are essential for applications such as data analysis. However, web often incomplete and short of some critical information, which makes it challenging to understand the table content. Automatically predicting column types without metadata is significant dealing with various from Internet. This paper proposes a CNN-Text method deal this task, fuses CNN prediction voting processes. We present augmentation synthetic generation approaches improve CNN’s performance use extracted text get better predictions. The experimental result shows that outperforms baseline methods, demonstrating well qualified type prediction.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Scalable Column Concept Determination for Web Tables Using Large Knowledge Bases

Tabular data on the Web has become a rich source of structured data that is useful for ordinary users to explore. Due to its potential, tables on the Web have recently attracted a number of studies [6, 18] with the goals of understanding the semantics of those Web tables and providing effective search and exploration mechanisms over them. An important part of table understanding and search is c...

متن کامل

Towards a Large Corpus of Richly Annotated Web Tables for Knowledge Base Population

Web Table Understanding in the context of Knowledge Base Population and the Semantic Web is the task of i) linking the content of tables retrieved from the Web to an RDF knowledge base, ii) of building hypotheses about the tables’ structures and contents, iii) of extracting novel information from these tables, and iv) of adding this new information to a knowledge base. Knowledge Base Population...

متن کامل

Population of a Knowledge Base for News Metadata from Unstructured Text and Web Data

We present a practical use case of knowledge base (KB) population at the French news agency AFP. The target KB instances are entities relevant for news production and content enrichment. In order to acquire uniquely identified entities over news wires, i.e. textual data, and integrate the resulting KB in the Linked Data framework, a series of data models need to be aligned: Web data resources a...

متن کامل

Knowledge Exploration using Tables on the Web

The increasing popularity of mobile device usage has ushered in many features in modern search engines that help users with various information needs. One of those needs is Knowledge Exploration, where related documents are returned in response to a user query, either directly through right-hand side knowledge panels or indirectly through navigable sections underneath individual search results....

متن کامل

Matching Web Tables with Knowledge Base Entities: From Entity Lookups to Entity Embeddings

Web tables constitute valuable sources of information for various applications, ranging from Web search to Knowledge Base (KB) augmentation. An underlying common requirement is to annotate the rows of Web tables with semantically rich descriptions of entities published in Web KBs. In this paper, we evaluate three unsupervised annotation methods: (a) a lookup-based method which relies on the min...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Mathematics

سال: 2023

ISSN: ['2227-7390']

DOI: https://doi.org/10.3390/math11030560